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Abstract. Given a C 1+7 hyperbolic Cantor set C, we study the sequence C n ,x of Cantor subsets which nest 
down toward a point x in C. We show that C n ,x is asymptotically equal to an ergodic Cantor set valued process. 
The values of this process, called limit sets, are indexed by a Holder continuous set-valued function defined on D. 
Sullivan's dual Cantor set. We show the limit sets are themselves C fc+7 , C°° or C w hyperbolic Cantor sets, with 
the highest degree of smoothness which occurs in the C 1Jrl conjugacy class of C. The proof of this leads to the 
following rigidity theorem: if two C k+1 , C°° or hyperbolic Cantor sets are C 1 -conjugate, then the conjugacy 
(with a different extension) is in fact already C k+ ~< , C°° or C w . Within one C 1+7 conjugacy class, each smoothness 
class is a Banach manifold, which is acted on by the semigroup given by rescaling subintervals. Conjugacy classes 
nest down, and contained in the intersection of them all is a compact set which is the attractor for the semigroup: 
the collection of limit sets. Convergence is exponentially fast, in the C 1 norm. 



Introduction. 

Consider the sequence C n>x of Cantor subsets which nest down toward a point x in a hyperbolic Cantor 
set C C [0, 1], and which have been affinery rescaled to have left and right endpoints at and 1. We wish to 
describe how the geometry of these sets changes as n increases. If C is a linear set like the middle-third set 
this is not so difficult to do (we always get just another copy of C!) but, as we shall see, with nonlinearity 
the behavior of this "scenery process" gets much more interesting. 

A different way to describe the small-scale structure of C is by the scaling function, introduced by Feigen- 
baum for a specific class of examples, and studied by D. Sullivan in the present setting of C 1+7 hyperbolic 
Cantor sets. 

A third approach is to define a flow, the continuous dynamics of which reflect the geometrical notion of 
zooming continuously down toward a point. Ergodicity of the flow implies analogues of the Lebesgue density 
theorem, proved in [BF 1] for Brownian zero sets and hyperbolic C 1+7 Cantor sets; the number one gets 
(order-two density) is a conformal invariant and provides a measure of the lacunarity of the fractal; compare 
[Mand] . This scenery flow is constructed for hyperbolic Cantor sets in [BF 2] , [BF 3] , for hyperbolic Julia 
sets in [BFU] , and for limit sets of geometrically finite Fuchsian and Kleinian groups in [Fi 2] . The associated 
translation (rather than dilation) scenery flows are studied for hyperbolic C 1+7 Cantor sets and the Fuchsian 
limit sets in [Fi 2] and [Bu-F], where theorems like those of [Fi 1] are proved (an order- two ergodic theorem, 
and infinite-measure unique ergodicity). 

In the present paper we will take a viewpoint close to that of Sullivan in [Su 1] , constructing the scenery 
process in a similar way to the scaling function. It is also possible to work in the other direction; in the later 
papers [BF 2] , [BF 3] we show how to derive the scaling function from the scenery flow, and conversely how 
to construct the scenery flow from the scaling function. From this point of view the scenery process will be 
seen as an intermediate object, serving to connect the scenery flow with the scaling function. 

Sullivan's main motivation in [Su 1] was to begin to develop a Teichmuller theory for Cantor sets, for 
use in a new, more "conceptual" proof of the Feigenbaum-Coullet-Tresser conjectures (see [Su 2] and [deM- 
vS]). (Lanford's proof [L] uses (rigorous) computer-assisted estimates). A first step would be to classify 
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differentiable structures on the attracting Cantor sets of folding maps. This classification should be in terms 
of some invariant which would serve as a "modulus" of the structure; the next step would be to put a complex 
structure on the set of moduli as in classical Teichmuller theory, see [Su 3]. Now for the particular case of 
the folding map which is the Feigenbaum-Coullet-Tresser renormalization fixed point, the Cantor set not 
only has a folding dynamics but also a hyperbolic C 1+a dynamics. (This observation is attributed by Rand 
[Ra] to Misiurewicz) . One is thus led to the following separate question, which is the subject of §§1-3 of [Su 
1]: for general hyperbolic C 1+a Cantor sets, can one classify differentiable structures? Sullivan shows this 
can be done, with the "modulus" being a bounded Holder scaling function. 

Our own main focus is somewhat different. We want to describe the exact geometry, at small scales, of 
the Cantor sets, whereas to the differentiable structure, all smoothly equivalent Cantor sets will look the 
same. However for this purpose also, the scaling function contains precisely the information one needs. 

Our main theorems (Theorems 5.4, 7.4 and 7.5) concern respectively the scenery process, the smoothness 
of limit sets, and C k+1 rigidity. We summarize the totality of the resulting picture. Given one C 1+7 hyperbolic 
Cantor set, consider the collection of all Cantor sets which are C 1+7 conjugate to it. Within this collection 
is a distinguished subcollection, its limit sets. The free semigroup on two generators acts on the conjugacy 
class (by rescaling subsets of the next level); the limit sets are an attractor for this action, and the scenery 
process can be described as what one sees when walking out a branch of the tree of the semigroup. Limit sets 
are exactly the ratio sets (see §2) built from the associated scaling function. Within the big collection arc 
subcollections with higher degrees of smoothness. The big collection forms an infinite-dimensional Banach 
manifold, naturally identified with a factor of the C 1+7 -diffeomorphisms of the interval, after one Cantor set 
has been chosen as a base point. (The diffeomorphisms are the conjugacies to this set). The subcollections 
nest down as smoothness increases, and by rigidity these smoothness classes are conjugacy classes as well. 
Contained in the intersection of them all is the collection of limit sets, with the highest possible smoothness. 
Choosing one of them as a common base point, these subcollections are naturally identified with the C k+1 
diffeomorphisms of the interval. Each is a Banach manifold in its own topology, and is dense in a larger 
collection with respect to its topology. The free semigroup acts on each manifold. Its points are drawn 
exponentially fast in the C 1 norm toward the common attractor: the collection of limit sets, which form a 
compact subset of the Banach manifold. 

In the course of our paper we give careful proofs of several of Sullivan's theorems ([Su 1] is extremely 
sketchy). In some cases our different point of view leads us to different arguments from those indicated in 
[Su 1]. We will describe our approach and results more fully after a further explanation of Sullivan's ideas. 

Sullivan's differentiable structures. 

We begin with an ordered topological Cantor set, i.e. a space which is homcomorphic and order isomorphic 
to the usual middle-third Cantor set. For convenience we use J2 + = njf^O, 1}, together with the product 
topology, and with the lexicographic order. Charts arc defined to be order-preserving homeomorphisms 
into R; two charts £, £ are C k+1 compatible if £ o extends, with that degree of smoothness, to a 
diffeomorphism defined on neighborhoods of the embedded sets. A linear C(k, a) differentiable structure 
on S + will be a maximal atlas (a maximal compatible collection of charts). Here, following [Su 1], C(k,a) 
denotes all maps which are C k+1 for some 7 £ (0,1]. Therefore, a C(k,a) linear differentiable structure 
determines and is determined by a class of Cantor sets embedded in the real line, equivalent by C(k, a) 
changes of coordinates. 

For simplicity, we are restricting our attention to charts which are order preserving and globally defined. 
We mention that the word "linear" is being used in two ways: when dynamics is introduced on these sets, it 
will usually be nonlinear; the differentiable stuctures are called linear because they come from embeddings 
in the line. (Alternative theories might have charts mapping £+ to a product of Cantor sets, or to a subset 
of some fractal curve!) 

Via the homeomorphism from S + , an embedded set C comes equipped with the dynamics of the shift 
map a on S + . The set also inherits from E + a nested hierarchy of intervals, corresponding to finite words 
(cylinder sets) in S + . Sullivan uses the shift map to define C(l,a) hyperbolic Cantor sets (see §1 below) 
and the nested intervals to define the ratio geometry of a Cantor set. This assigns to each interval the 
triple (l,g,r) of length ratios of the left subinterval, middle gap and right subinterval respectively. The 
hypothesis that an embedded Cantor set C is hyperbolic C(l,a) is enough to show that a limiting ratio 



RATIO GEOMETRY, RIGIDITY AND THE SCENERY PROCESS 



3 



geometry, recorded by the scaling function, exists. The ratio geometry is bounded away from and 1, a 
condition called bounded geometry; hence the limiting values (a, b, c) are also in the interior of the simplex 
A = {(a, b, c) : a + b + c= 1}. Convergence to the scaling function is taken along inverse branches of a, which 
are indexed by points of an abstract topological Cantor set called the dual Cantor set. Thus the scaling 
function maps the dual Cantor set to A, onto a compact subset of its interior. Convergence is exponentially 
fast, and the scaling function is Holder continuous. 

Locations in the Cantor set correspond to forward images under a, since the digits of S + tell whether the 
orbit of a point lies in the left or right third of C. To study smoothness of a conjugacy or an expanding map, 
one expects of course to use the locations to estimate difference quotients. However since convergence to 
the scaling function is taken along inverse branches of a, as the scale gets smaller and smaller, the locations 
jump all over the set. 

The first remarkable result from §§1 — 3 of [Su 1] is that while indeed one cannot compute the derivative 
of the shift map from the scaling function, nevertheless this function contains complete information about 
C(l,a) differentiable structures. 

More precisely, one has the following. As we have already mentioned, (1) a hyperbolic C(l, a) Cantor set 
has a bounded Holder scaling function. Next (2) this depends only on the differentiable structure i.e. it is 
the same for C(l,a) conjugate Cantor sets. Conversely (3) an embedded Cantor set which has a bounded, 
Holder scaling function is in fact C(l,a) hyperbolic. Finally, (4) in this case the C(l,a) differentiable 
structure is determined by the scaling function. In other words two hyperbolic C(l,a) Cantor sets with 
the same scaling function arc C(l, a) conjugate. In summary, the bounded Holder scaling function gives an 
intrinsic characterization of the differentiable structure, in the sense that no embedding need be specified. 

Now furthermore, quoting [Su 1]: "... if the structure admits a C(k,a) refinement so that the shift is 
C(k,a), this structure is also determined uniquely by the same scaling function ... ". Stated as a result 
about representatives instead of the entire equivalence class, this can be interpreted as a rigidity theorem: 
if two C(k, a) hyperbolic Cantor sets are conjugate by a map which is C(l, a), then that map (possibly with 
a different extension to the gaps) is in fact already C(k, a). 

Summary of results. 

We include in this paper careful statements and proofs in particular of (1), (2), (4) above, and of C(k,a) 
rigidity. (We mention that our use of the term "rigidity" is different from that in §5 of [Su 1]). Since we are 
interested in the geometry of representatives rather than the equivalence class, all these results are stated 
in terms of the conjugacy of embedded sets rather than the classification of differentiable structures. As we 
said above, the reason for this emphasis is that our primary goal is to study the scenery process, and all the 
sets in the scenery process are the same up to conjugacy. 

A small technical difference to [Su 1] is that we use C fc+7 rather than C(k,j) throughout. We do this 
because it gives sharper statements. Thus e.g. for rigidity we show that C 1 conjugacy implies C k+1 conjugacy. 

In part because of our change in focus, we give a different proof from that suggested in [Su 1] of (4). 
Each approach has its own advantages. Sullivan's method, a direct estimate of the derivative by difference 
quotients using sums of gap lengths, gives a unified way of proving (3) as well as (4). However one also 
needs to cite an extension lemma, which is not included in [Su 1]. On the other hand our approach gives a 
unified treatment of C 1 conjugacy and rigidity, and avoids calling on the separate extension lemma. We do 
not prove (3) here, but will give a full proof (along the lines of [Su 1]) elsewhere. 

Our proof of rigidity is intimately connected to the study of the scenery process. We proceed as follows. 

First we use limiting conjugacies to construct a set- valued analogue of the scaling function. This function, 
y C y , is defined for y in the dual Cantor set, is Holder continuous with respect to a metric derived from 
the corresponding Hausdorff measures and has as its range a compact subset of the collection of all subsets 
of [0, 1] in that measure metric, and also in the Hausdorff metric on sets. The scaling dynamics enters by 
interpreting the dual Cantor set as the past of the natural extension of the expanding map on C; the scenery 
process C„ jX is then asymptotically given by evaluating the shift on any extension x = (y, x) of x G C. Since 
the limit sets were constructed by conjugacies, one can apply a lemma from the appendix of [Su 1] to help 
determine their degree of smoothness: we show they have the highest degree of smoothness [C k+1 for some 
k > 1, C°° or C w ) which occurs in the C 1+7 conjugacy class of C . 

Next, the proof of rigidity follows as a corollary. Given two hyperbolic C k+1 Cantor sets, if they are C 1 
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conjugate they have the same scaling function. Hence they have the same limit sets, which are ratio Cantor 
sets constructed from this function. Choosing one of these to act as an intermediary, the composition should 
also be C k+1 . However the maps may be defined differently on the gaps, which would lead back to the 
extension problem mentioned before. But now one has a simpler solution: a choice is made on the middle 
third, and the rest of the definition follows automatically from the dynamics. This completes the proof of 
rigidity. In summary: if two C fc+7 , C°° or C u hyperbolic Cantor sets are C 1 conjugate, then this conjugacy 
(with a different extension) is already C fe+7 , C°° or respectively. 

This leads, then, to the overall picture which is summarized at the end of the first part of the Introduction. 

Now one knows, from the rigidity theorem, that the maximum degree of smoothness occuring in the C 1+7 
conjugacy class should be encoded somehow in the scaling function. Work of Tangerman and Przytycki gives 
one way of recovering that information [T-P]. A. Pinto and D. Rand ([P-R 2], §5 and personal communication) 
and Dennis Sullivan (personal communication) have suggested other approaches, in a related situation. It 
would be nice to understand in a unified way these different points of view. 

The rigidity theorem is also stated by Tangerman and Przytycki; it is proved as a corollary of their main 
result. Their approach is quite different from ours and in particular does make use of Whitney's Extension 
Theorem. (We became aware of their preprint after the first version of this paper - an IHES preprint, July 
1992 - was completed). 

Rand introduces the notion of a Markov family to help study the relationship between scaling functions 
and smooth conjugacy in situations where one has a sequence of expanding maps, rather than a single map. 
The examples studied in [Ra] , [P], and [P-R 1, 2] include certain circle diffeomorphisms and folding maps. 
See [AF] for some related developments. 

Interesting work on the small-scale geometry of certain fractal sets, in quite different settings, has been 
done by Hillcl Furstcnberg and Tan Lei. Tan Lei in [T] proves the beautiful theorem that certain nonhy- 
perbolic Julia sets, corresponding to Misiurewicz points in the boundary of the Mandelbrot set dM, are 
asymptotically the same as dM at that point. These points form a countable dense subset of dM , yet 
the general case is still far from completely understood. This type of asymptotic limit, as well as what we 
have called here limit sets, provide examples of Furstenberg's general notion of the microsets of a subset of 
Euclidean space (lectures and personal communication). These are by definition all the limiting sets given 
by rescaling nested subsets by a sequence of affine expansions. Furstenberg applies this in a continuation of 
the analysis begun in [Fu] for determining the Hausdorff dimension of certain sets: intersections of generic 
translates of linear Cantor sets, and intersections of linear Cantor sets in the plane with foliations of straight 
lines at a generic slope. His study of these matters is related to the "times 2 times 3" circle of problems in 
Ergodic Theory. An interesting and important area of research is to develop similar results in a nonlinear 
setting, e.g. for general smooth foliations or for nonlinear Cantor sets. 

Acknowledgements. We wish to thank our colleagues and friends for conversations, encouragement and 
inspiration regarding this and related projects. We give special thanks to M. Urbanski, B. Mandelbrot, 
S. Kakutani, and D. Sullivan. The second-named author would also like to thank the Dynamical Systems 
seminar at Memphis State University for their encouragement to give a series of lectures on these topics 
(Spring 1990), and Yale University, MSRI, IHES, CUNY, the CNRS, Universite Paris-Nord, and SUNY at 
Stony Brook for their support while the paper was being written. 

§1 Two ways of building Cantor sets. 

§1.1 Hyperbolic Cantor sets, Hausdorff and Gibbs measures. We start with the usual middle-third 
Cantor set. Let S denote the 2-1 map on the middle-third set C defined by i ^ 3x(modl). The Hausdorff 
dimension of C is d = log 2/ log 3; writing H d for d-dimensional Hausdorff measure and /x for the restriction 
H = H d \c 1 we recall that /x is a Borel probability measure (total mass =1) which is invariant under S. The 

oo 

triple (C, S, xx) is canonically isomorphic to the one-sided Bernoulli left shift a on S + = Yl{Q> l}i w itb infinite 

o 

(\,\) coin-tossing measure; the correspondence is given by 7r : {xq>x\ . . . ) ^ x where x £ C has ternary 
expansion 

oo 

x = ^2x,3~\ 
»=i 
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A hyperbolic C 1+7 Cantor set C by definition also has an expanding dynamics S : C — ► C, but now 
instead of having straight lines as for 3a;(modl), the graph of S may be nonlinear: 




[Figure 1] 



To construct such a set, one defines C as a limit from two contraction mappings ifo,ifi : 7 — ► 7. We 
consider first the case where these maps are orientation-preserving, and are strict contractions in the sense 
that the derivatives satisfy < a < Dtpi < (3 < 1. We also require that 



= po(0) < M 1 ) < (0) < pi(l) = 1. 



This implies that the intervals 7 = tpo(I),Ii = ifi(I) are disjoint. We assume that ip , ipi are C 1+1 maps 
for some 7 £ (0, 1]. Here C fc + 7 means the fc th derivative D k ipi is Holder continuous with exponent 7; note that 
C 1+1 means Dtpi is Lipschitz, so C 2 implies C 1+1 (by compactness) but not conversely. (Exponent 7 > 1 is 
excluded because in that case, since the domain 7 is connected, ipi is identically constant hence immediately 
of order C°° - while the whole purpose of Holder conditions is to have intermediate grades of smoothness). 
Our convention for Holder continuity will be: if we are given that / satisfies \f(x) — f(y)\ < c \x — y| 7 , then 
we say / is Holder continuous with Holder constant c and Holder exponent 7. We remark that 
Sullivan in [Su 1] works instead with C(k,a) maps; these are defined to be the collection of C k+1 maps for 
all 7 e (0,1]. 

We define S : Iq U I\ — > I to be the map with inverse branches <po,<Pi- Note that since Dipi are bounded 
away from and 00, it follows that S is C 1+1 with same Holder exponent, but with different Holder constant. 
Inductively, form 

Ix ...x n = Pxaifx! ■ ■ ■ {<fix n {I))) 
(union over all choices, with n fixed) is the n th level approximation to the 

00 

c = nu 7 *o...*n- 

n=0 

The restriction of the map S to C maps C to itself and is (just as for the middle-third set) conjugate to the 
Bernoulli shift (S + , a), via the map it : (xqXi . . . ) 1— ► x, where x is the unique clement of (~Y%L I X0 ... Xn . 

A set C together with map S : Io U I\ — > /, defined in this way from strict contractions fo,fi, wih be 
called a strictly hyperbolic C 1+7 Cantor set (with map). 

Sometimes we are only interested in the dynamics on C itself. Knowledge of this restricted map Sic is 
equivalent to knowing how C is coded by S + . We will refer to a set together with this labeling as a marked 
Cantor set. When we forget about this coding, C will be referred to as the underlying Cantor set of (C, S). 

More generally, a hyperbolic C 1+7 Cantor set is defined as follows. Again we assume that ipo,ipi : 7 — > 7 
are order-preserving C 1+7 diffeomorphisms such that (po(0) = 0,<^ 1 (1) = 1, and ipo(l) < <£i(0). We also 
assume as before that there exists a with < a < D(fi < 1. However now the upper bound is replaced by 
one of the two equivalent conditions which follow. We write <p xa ... Xn = ip xo o • • • o ip Xn . 



where x k G {0, 1}; [j I X( 
Cantor set, C, defined as 
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Proposition 1.1.1. The following are equivalent: 

(i) 30 < 1 and N > such that for all n > N, for any x . . . x n , Dip X0 ,,, Xn < n 
(ii) 3c > and < 1 such that for all n, for any x ■ ■ ■ x n , D(p Xo ,,, Xn < c0 n . 

Proof. To pass from (ii) to (i), any will work which satisfies < < 1. For the converse, take c = 



Two C 1+7 hyperbolic Cantor sets C, C with maps S, S will be said to be continuously conjugate, 
or C°— conjugate, if there is an order-preserving homeomorphism <& : I — > I such that for all x G Jo U I\, 
S o $(a;) = $ o S(x). We say C, C are C fc+7 , C°°, C" conjugate if $ and its inverse have that degree 
of smoothness. Note that from the definition, for each n, <&(J Xo ...x„) = Ix —x n - Therefore the conjugacy 
induces the identity map on the corresponding shift spaces. 

When $ is defined (as above) on all of I we will also call it a full conjugacy. A restricted conjugacy is 
a conjugacy between the Cantor sets which can be extended to a full conjugacy. 

Lemma 1.1.2. Let (C,S) be a hyperbolic Cantor set, and assume $ : I — > I is a C 1+7 diffeomorphism. 
Define sets C = $(C) 7 U = for i = 0,1, and define the map S : I U h — > I by S = $ o S o Then 

(C, S) is also a hyperbolic C 1+7 Cantor set. 

Proof. This is immediate. Note that using condition (i), stays the same but the constant c may change. 
Similarly, using (ii), N may change while remains the same. □ 

Thus in particular the C 1+7 conjugate of a strictly hyperbolic set is still hyperbolic, though strictness may 
be lost. We remark without proof that a converse holds: by a well-known theorem due in its original form 
to Mather, any hyperbolic Cantor set is conjugate to a strictly hyperbolic set (without changing the order 
of differentiability) . Therefore if one is studying properties invariant with respect to the equivalence relation 
given by conjugacy, one might as well begin with the assumption that c = 1 in (ii); this situation occurs 
often in the dynamical systems literature. The new metric on / is referred to as an adapted metric for the 
hyperbolic map S. However for our purposes it will be important to use the original metric; otherwise the 
notion of the scenery process will lose its meaning. This will become clear in §5. 

We recall from the theory of Bowen and Ruche ([Bo 1,2], [Ru]; see also [Be]) that the dimension d of a 
hyperbolic C 1+7 Cantor set C is strictly between and 1 and that Hausdorff (or conformal) measure 
yu = H d \c has a unique normalized invariant version v, called the Gibbs measure (or Gibbs state). For 
the middle-third set /i — v, in general they arc boundcdly equivalent (i.e. the Radon- Nikodym derivative is 
bounded away from and oo); v is defined so as to be a probability measure while /i may have total mass 
7^ 1. As in [BF 1], we will need to use both measures. 

We remark that all the results in this paper generalize with minor notational changes to the following 
situation: the maps ipi are also allowed to be orientation-reversing; there may be more than two maps, 
tpi,.. .,tpk, and the Cantor set is constructed by selecting the maps with respect to some subshift of finite 
type XU on k symbols instead of the full two-shift S. 

§1.2 Ratio Cantor sets. As before, S + denotes ng°{0, 1} and now we define: S" = ni^{0, 1}, S = 
n^° oo {0, 1}. We will write y = (. . . y_2j/-i) for y G x = {xqX\ . . .) G S + and £ = (y, x) = (. . . X-2X-\.xqX\ ■ ■ ■) = 
(. . . y-2y~i-XQX\ . . .) for a point in E. We will let a denote both the (full) left shift on £ and the left shift 
(with truncation) on E + . S + is known as the future of S, and YT as its past. 

Write A for the unit simplex in R 3 and intA for its interior. Let R be a continuous function from E _ to 
intA, and write the components R = (Ri,R g , R r ). These letters will stand for left, gap and right respectively; 
by definition they add to 1 and each is strictly positive. For x = (y, x) G S we will also think of R as a 
function on S, by defining R(x) = R(y). 

Given the function R, we will define for each y G E~ the ratio Cantor set C v C [0, 1] so as to satisfy the 
following: at each stage, subintervals will have length ratios R(cr n (x). Thus, we first define Jq = [0,Ri(y)], 
If = [1 — R r (y), 1]. The left interval Iq has subintervals Iq , Iqi which are defined to have lengths in the 
ratios 



max{ (Dip. 



'XQ-.-XN 



)-0~ N } 



and = 0. □ 
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and left and right cndpoints the same as those of Iq, respectively. Inductively, for x E S + and x = (y,x), 
I Xo ... Xn+1 i s a subinterval of I% .„ Xn with length ratio 

\-Lxo~-Xn I 

Here * = I, i.e. this is the left subinterval, if x n +i = 0, and * = r, i.e. the right subinterval, if x n +\ = 1. 

Note that the fact that R depends only on the past coordinates y E £~ of x is what makes this well-defined, 
since therefore the ratio is the same for each other point in that subinterval. 

Now finally we form the set C v as before, defining 



c=nii2o- 



n=0 

The simplest example is again the middle-third set: taking R(y) = (|, |, |) for all y E E~, C v is the 
middle-third set C for each y. 

Since R by assumption is a continous map from a compact set into the interior of A, hence strictly into 
the interior, these intervals I XQXl ... nest down to a single point in [0, 1]. Hence each C v inherits from S + the 
dynamics of the shift map a. (We will also write a for this map on C v ). 

For R assumed to be Holder continuous with some exponent a > (which will be the case in the present 
paper), it turns out that each ratio Cantor set C v is also a hyperbolic C 1+7 Cantor set. (What needs to be 
shown is that a is C 1+7 on C v for some 7 with < 7 < 1, which follows by a bounded distortion argument, 
and that it can be extended to a map S on Iq U If without losing any smoothness and which is hyperbolic. 
This can be proved from a lemma of Sullivan, (3) in the Introduction above. For a full proof see [BF 3]). 
Therefore it also has a Gibbs measure equivalent to Hausdorff measure. The dimension of C v and the Gibbs 
state, viewed as a measure on S + , are the same for each y E E~ for the following reason: for y, w E £~, C y 
and C w have the same scaling function and hence are C 1+7 conjugate. (See §7). 

§2 Statement of the problem; Bounded distortion. 

Let C be a hyperbolic Cantor set. We wish to describe the geometry of the sequence of nested Cantor 
sets one sees along the way when zooming down toward a point. 

Notation will be as follows: for x E C with x = tt(x xi . . . ), let C n , x be the set CC\I Xa ... Xn affinely rescaled 
to the unit interval, so as to have endpoints at and 1. 

Write jU„ iX for the corresponding Hausdorff measure, the restriction of H d to the set C n , x - Thus we want 
to see how the sequence of sets, and of the corresponding measures, varies as n — ► 00. 

The first obstacle we encounter is that you don't get from the interval I Xo ...x n to its subinterval I Xo ...x n+1 
by one application of ipo (or ipi). Instead you have 

Ix ...x n+1 = <Px ■ ■■<Px n+1 { 1 ) = Vx ■ ■ ■ <Px n+1 (.tPx* ■ ■■<Pxli I x ...x n )), 

and since the maps don't commute, you have to go all the way back up and down the tree again, with 
more nonlinearity introduced each time. To control this nonlinearity we will use the well-known Bounded 
Distortion Property, in the following variation. For a proof see [Sh-Su], [Mane] or Lemma 6.4 below. We 
learned this version of bounded distortion from M. Urbanski. 

Theorem 2.1 (Classical). With S as above, 3K > such that for all n, for any 5 > 0, if J is an interval 
such that S m \j is 1-1 and the image S m (J) has diameter less that S, then for all x,y E J, 

-Ksn < I DS m x K s-> 
<l DS m y l • 

We mention that one sees from the proof that if c is the Holder constant for log \DS\, then the constant 
K is given by K = c/3 7 /(l - /3 7 ). 

As a consequence of this theorem, since < a < D(fi < [3 < 1 implies that a n < \I Wo ... Wn \ < (3 n for any 
w, we have: 
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Corollary 2.2. For any m,n>0 and any w G Y\{0, 1} one has for all x, y G 7, 



o 

-K(i^ \DS m {x)\ Kr -, 
< \DS^ii))\ ' 

The set Ck, x belongs to the collection of 2 k Cantor sets at level k in the tree (rescaled). We want to 
understand the geometry of the sets in this collection. 

A first approximation is the original set itself (at level 0). But by bounded distortion, for n large the 2 n 
sets at level n provide much better models for the 2 m+n sets at level k = m + n; moreover (and this is the 
strength of bounded distortion), this is true for all m simultaneously. The reason is that since by definition 

and since I Xo ...x„ has small diameter, by the Corollary the derivative of S m is close to constant - hence S m 
is close to linear. 

In summary, consider all the Cantor subsets which have this same image under 5"™ to be grouped in one 
equivalence class. The 2 k sets at level k are split into 2™ equivalence classes, each with 2 m members which 
all have approximately the same geometry, (but whose locations are scattered throughout the space!). As 
we scale down toward a point x, we are seeing sets given by these approximations. 

Note that the equivalence class of a given interval at level k depends on the immediately previous n 
branches, rather than on its initial branching structure. We will see in the next section how Sullivan uses 
this observation to study the asymptotics, associating to the Cantor set a function R like that used to define 
the ratio Cantor sets in the previous section. Then, in §5, we will show that the sequence of sets one sees in 
C is asymptotically the same as that for the ratio Cantor sets C y . 

And now for the set C v , the nested sequence of subsets has an exact description. Each subset is itself 
a ratio Cantor set. Moreover the sequence changes in the following way. Writing as above C y x for the set 
C y n I y Xn affinely rescaled to [0, 1], one has immediately from the definitions that C y x — C" 7 "^ for all 
n > 0. With the Gibbs measure on the full shift, this gives a stationary, set- valued process - which in forward 
time describes exactly what one sees as one zooms down toward Hausdorff-almost every point in the ratio 
Cantor set C v . 

§3 Sullivan's Scaling Function. Now we return to the study of a hyperbolic Cantor set C. Instead of 
treating the structure of the entire set I Xo ... Xn H C, which is what we have been emphasising so far, Sullivan 
focuses on the information contained in the first step of its construction, given by the relative lengths of the 
subintervals of I Xo ... Xn . These subintervals are the left third I Xo ... Xn o, right third I Xo ...x„i and middle gap 
written G Xo ... Xn . We normalize the lengths of these three intervals, defining for x G C and n > 0, where 
x = irixoxx . . . ), 

Rn,x (|7e ...i E?1 o|j \G XQ ... Xn \ 1 \I Xo ... Xn l\)/\I Xo ... Xn \- 

This is called by Sullivan the ratio geometry function of C; it maps N x C to the interior of the unit 

simplex A C R 3 and determines C uniquely (one simply constructs C to have these ratios). 

„ -l 

Next we write, for y = (. . . J/-2J/-1) in C = f\ {0; 1}> 

—00 

Rn{y) = (|7y_„... y _ l0 |, \Gy_ rl ...y_ 1 \, \ly _ „ . . .y _ ± 1 | ) / | Iy _ „ . . . y _ j | . 

Following Sullivan, it is nice to think of C as a distinct Cantor set, dual to C (and called the dual Cantor 
set). Later for the dynamical interpretation we will instead view C as S~, that is, as the past coordinates 

of the full shift S = Y\{0, 1}. We will use whichever symbol (C or E~) is more appropriate in the context. 

— oo 

As in [Bo 1], for any f3 G (0,1), the /3-metric on £~ (which defines what is meant below by Holder 
continuity) is taken to be: dp(y,w) — (3 n where n is the greatest positive integer such that y_„ . . = 
. . . W-\. When £~ is thought of as C, i.e. as dual to a specific hyperbolic Cantor set C, we choose (3 to 
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be (as before) the upper bound on Dipi. We mention that if (3 is replaced by some other number (3 £ (0, 1), 
then the metrics are related by 

d / 3 = (d /3 ) 1 ° s3/log/3 , 

and the Holder exponent 7 for R in the statement of the next theorem would change to 7 • (log (3/ log (3). 
Theorem 3.1 (Sullivan). Let C be a hyperbolic C 1+7 Cantor set. For every y in the dual Cantor set C, 

R(y) = lim R n (y) 

exists. The convergence is of order 0{(3 ni ), uniformly in y, and the function R is Holder continuous with 
exponent 7, in the (3 -metric. R takes values strictly in the interior of A. 

Definition. R is called the scaling function of C. 

Proof. We will first show that for each y, R n (y) n = 1, 2, ... is a Cauchy sequence. Since 

S (^!/-(n+m) ■■■V-l) = Iy-n---V-l 

and similarly for the subintervals, applying the Mean Value Theorem and Bounded Distortion Property 
(Corollary 2.2) we have for all m > 

R n (y) = R n+m (y)e ±K ^ 

Therefore R n {y) is Cauchy sequence (i.e. each of its three coordinates is) hence it converges; call the limit 
-1 

R(y). Next, if y, w £ f\ {0, 1} agree on the coordinates — n, . . . , — 1 then since R n (y) = R(y)e ±K @ and 

—00 

Rn(y) = R n (w), we have 

R(y) = R{w)e ±2K < 3n \ 
Writing || • || for sup norm in R 3 , this implies that, with the log taken by components, 

|| logR(y) - log i?H ||< 2K(d p (y,w)r, 

i.e. logi? is Holder continuous with exponent 7; therefore so is R. □ 

Thus (since the normalized lengths add to one), R maps C onto a compact subset of the interior of the 
unit simplex in R 3 . 

§4 Dynamical versions of Sullivan's theorem. In this and the next section we return to the original 
motivating question: what does the sequence of sets C n , x look like? This is exactly what one sees for the 
n th level Cantor set, as one zooms down toward x. 

First we state Theorem 3.1 in a dynamical form. Here it will be crucial to think of the dual Cantor set C 
as the past £~ of S. We extend the function R to S by defining: R(x) = R{y) for x = (y, x). This function 
depends only on the past coordinates y of x. 

Corollary 4.1. For each x G C , for any choice of w £ S such that 3k > with u>k,Wk+i ■ ■ ■ — Xk, Xk+i ■ ■ ■ 
(where x — n(x xi ...)), then 

II R n ,x — R(o- n w) || — > as n — > 00. 
The proof is immediate from the definitions, and in fact if the Holder constant for R is c > so that 

\\R(x)-R(z) \\<cd (x,z), 

one has 



Rn, x -R(o- n (w)) ||<c/3 ! 
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here the /3-metric has been extended to E in the natural way, with points x, w having to agree on coordinates 
from — n to n. 

We note that equivalently, if x and w are in the same unstable set in (C, S) then the sequence R n ^ x (with 
any past) is in the stable set (in the shift on sequence space) of the sequence given by R sampled along the 
shift orbit of x, with an exponential rate of convergence. 

We recall that a stochastic process is simply a (one- or two-) sided sequence of measurable functions /, 
(known as random variables) defined on some probability space (Q, v). The process is stationary if a time- 
shift doesn't alter the probability of an event. Equivalently, the space of paths {(..., fi{u>), ••■)}, acted on 
by the shift tranformation and given the pushed-forward measure, is a measure-preserving transformation of 
a probability space. Conversely, a measure-preserving transformation determines many stochastic processes: 
choose a measurable function and evaluate it along orbits. Thus for example R(a n w) is a (A-valued) 
stochastic process. In the next section we will encounter set- and measure- valued versions of this. 

Next, recall the definition of a generic point x for an ergodic measure-preserving transformation T on 
a compact metric space X with probability measure to. For each continuous / : X — > M, x satisfies: 



That is, x samples each continuous function well with respect to time averages. If X is a Polish space 
(a complete separable metric space) - this will occur in the next section - , then we instead sample the 
continuous functions with compact support. By the remarks in the previous paragraph, this definition also 
makes sense for a stationary ergodic stochastic process, if the path space has been given the topology of a 
Polish space. In the definition of generic point we only take time averages toward +00, so in the case of a 
two-sided stochastic process, it will be natural to allow a one-sided sequence as a generic point, as well. 

If the measure lives in a compact part of the space (which will always be the case in this paper) then 
by an ergodic theorem of Kryloff and Bogliouboff (i.e. by the Birkhoff ergodic theorem plus compactness), 
TO-almost every a; is a generic point. 

Now \x is equivalent to the Gibbs measure v 1 which is invariant and has a unique invariant natural extension 
u on (£, a). Hence by Kryloff and Bogliouboff: 

Corollary 4.2. For [i— a.e. x G C, the (one-sided) sequence R\ tX) R2, x ■ ■ ■ is a generic point for the ergodic 
^.-valued process R(a n (w)) for nGZ, given by w&T, being distributed like v. 

§5 Conjugacies, and the scenery process. In this section we will construct a set-valued version of the 
scaling function, and use it to prove analogues of Corollaries 4.1 and 4.2, which will describe how the sequence 
of sets C niX approximates the scenery process. We will make use of three different metrics on collections of 
Cantor sets. One metric, which is derived from the C 1 norm on the space of conjugacies of Cantor sets, is well 
suited to proofs and is natural from an abstract point of view. There we will prove properties (convergence 
at at exponential rate; Holder dependence) which will then pass over to two geometrically defined metrics: 
the Hausdorff metric, and a metric derived from the Hausdorff measures. 

Three metrics. Our metrics will be defined on several different spaces. Fix a hyperbolic C 1+7 Cantor 
set with map, {C,S). We write £ 1+1 = £ 1+7 (C) for the collection of Cantor sets (with maps) which are 
C 1+7 -conjugate to (C\S). (From Lemma 1.1.2, these are also hyperbolic C 1+7 Cantor sets). We write £i +7 
for the quotient space of £ 1+7 where (C, S) and (C, S) are identified if S — S on C. This is the collection of 
marked Cantor sets conjugate to C, or equivalently the pairs (C, S\c) with restricted maps. Elf 1 will denote 
the collection of underlying Cantor sets. We write Diff 1+7 for the C 1+7 order-preserving diffcomorphisms 
of /. Given choice of the pair (C,S), Diff 1+7 projects onto £ 1+7 (C) in a natural way: / is mapped to 
(Cf, Sf) = (f(C), f o S o J -1 ). This is many-to one because there is some freedom given by the gaps; see 
Proposition 8.3. We note that the projection from £ 1+7 to £* +7 is also many-to-one. 

The scenery process can be thought of as taking values in fi +7 , the marked Cantor sets, or in the space 
of underlying sets £*^ 7 . We will first prove convergence in the space of conjugacies, Diff 1+7 ; this will then 
imply convergence in the other spaces. 




N-l 
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First we consider two metrics on 7 . We recall the definition of the Hausdorff metric on the collection 
of closed subsets of the interval I: 

d H (A, B) = inf{e : A + (-e, e) D B and B + (-e, e) D A}. 

This defines a metric on Elt 1 , and a pseudo-metric on the other spaces defined above. 

Next, we define the following metric on the set of finite Borel measures on [0, 1], denoted Ad. Enumerating 
binary intervals Ei,E 2 , . . . ,E n . . . of the form \j2~ k , (j + l)2 _fc ], for v\, v 2 in Ad, set 

oo 

dfa, V 2 ) = \MEn) - MEn)\/2~ n . 
n=l 

This metric induces a topology equivalent to the weak topology on Ad, in the language of probability theory; 
in analysis terminology this is the weak-* topology on Ad, the dual of the space of continuous functions. 
On Sit 1 we define the measure metric cIm from this, setting: 

d M (C,D)=d(H d \ c ,H d \ D ) 

where H d is d-dimensional Hausdorff measure. On the other spaces, this again defines a pseudo-metric. 
Next, recall that the C 1 -norm of / : I — ► K is: 

H/IICI = H/Hoo + H^/lloo- 

We identify Diff 1+7 with the collection of triples (C f , S f , /) for / e Diff 1+7 , to be written as £ 1+7 . (As we 
noted above, the map from £ 1+ ~< to £ 1+7 is not one-to-one). The C 1 -norm on Diff 1+7 determines a metric 
on £ 1+ "< as follows. For /, g in Diff 1+7 , we write 

d c (C f ,C g ) = \\f-g\\ cl . 

We call this the C 1 metric on £ 1+7 . 

Note. The metric dc keeps track of the map S on all of its domain Iq U 7i, while dn and c?m only see the 
Cantor sets. Thus for the C 1 metric, writing I for the identity map, (C, S,T) and (C, Sf, f) will be a positive 
distance apart unless in particular S = Sf on all of Iq U I\. (Of course also one needs / = 1). 

The definition of dc depends on the initial choice of the set (C,S) (with the identity map); the next 
proposition shows how the metric varies if we change this "base point" of £ 1+7 . As we will see in §8, Diff 1+7 
is a Lie group and this is also a statement about bounded invariance of a metric on that group. 

Proposition 5.1. Let D e £ 1+7 ; with D = $(C), with $ e Diff 1+7 . We have: 

-^d D <d c < Kd D 

where K = 2max{\\D$\\ 00 , HD^" 1 ) l^ 1 }. 

Proof. We note that from the definition of the C 1 -norm one has that if / : I — ► I with /(0) = 0, then 
WfWci < 2||D/||oo. Therefore if also g(0) = 0, then \\g o f\\ ci < 2|| 5 || C1 ||/|| C1 . 
Now for /, g and $ as in the statement of the Proposition, we have: 

d D (C f ,C g ) = ll/o^-go^Hd = ||(/-. 9 )o$- 1 || cl 
<2||/- 3 || cl || ( f>- 1 || cl =d c (C / ,C ff )||$- 1 || cl 

which gives one of the inequalities. The other is proved in the same way. □ 

Next we will look at how (on £ 1+7 ) the pseudo-metrics dn and g?m compare to the metric dc- First we 
recall how the Hausdorff measure transforms under mappings. 
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Definitions. Given a 1-1 differentiable map 9 : M — ► N between open subsets of R and given a Borel 
measure \i on M and real number d > we write: (ip*fi)(E) = nfjP^E) and 



(&li)(E) = [ \D<P\ d d^i. 

Jl'- 1 E 



Thus if - */! is the usual push forward of fi, and >P[i is the d)— conformal transform of /i. 

Hausdorff measure has the conformal transformation property with respect to C 1 maps: for <P : R — > 
K a C 1 diffcomorphism, 

H d = $( H dy 

Proposition 5.2. With dn, du and dc denoting the Hausdorff, measure, and C 1 metrics respectively, for 
allCf, C g e£ 1+ ~<, 

d H (C f ,C g )<d c (C f ,C g ) 
and for ^(x) = 5x + Ax 2 , we have for all Cj £ £ r {C), 

d M (C,C f )<*(d c (C,C f )). 

Proof. For the Hausdorff (pseudo)-metric this is immediate, using the L°° norm, since 

d H (C f ,C g )< \\f-g\\oo<d c (C f ,C g ). 
For the second inequality, writing 1 for the identity map on I, we have 

dc(C,C f ) = \\f-X\\ci 

and writing = H d \ c , fif = H d \ Cf , 

d M (C, C f ) = ]T \v E n - VfE n \2- n 

n 

= El/ \Df\ d dn-[ ld M |2- 

<2||.f-X|| 00 ||A/||iE + E / 

(here the first term bounds the contribution, for each interval E n , of its two ends not matching up exactly 
with f~ 1 E n ; we used the fact that since / is a diffcomorphism of I, ||£)/||oo > 1). 

Next, we note that for all x > 0, \x d — 1| < \x — 1|. Hence ||D/| d - l| < \\Df \ — l|, so the above is 

< 2||/ - X|U(1 + \\\Df\ -l\\J + \\ \Df\ -l^Yl \ E ^ n 

n 

< 4|||£)/| - ill (1 + II |D/| - l|| ) + |||D/|-l|| 

— II 1 ^ 1 II oo v II 1 J 1 1 1 oo 7 II 1 *' 1 Hoc 

< 4(1 + \\\Df\ lL + 1)(|||D/I - 1|L) = 5|||D/| - 11^+4(11125/1 - 1||J 2 
<*(dc(C,C f )), 

as claimed. □ 

We define the (restricted) C 1 metric on £* +7 to be 

dc*(C f ,C g ) = \\f-g\\ci,c 

where this indicates that the sup norms are taken over the Cantor set C. Obviously dc* < dc on £ 1+7 . 
The first theorem we are aiming for will state: 
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Theorem 5.3. Given a C 1+1 hyperbolic Cantor set (C, S), for every y = (. . . y_„ . . . y_\) G E , the limit 

C(y) = lim C y _ n ... y _ 1 

n — >oo 

exists. Convergence is exponentially fast, and C(y) is Holder continuous with exponent 7 (for both metrics 
on Elt 1 , and for the restricted C 1 metric on the marked Cantor sets £l +1 ). Moreover, C{y) = C y , the ratio 
Cantor set built from the scaling function of (C, S). 

Thus C(y) is a set- valued version of the scaling function R(y). To prove Theorem 5.3, we first construct 
certain conjugacies (Theorem 5.9), proving convergence in the C 1 metric. Convergence in the Hausdorff and 
measure metrics then follows from Proposition 5.2. Before giving this construction, we state several further 
consequences of Theorem 5.9. 

In the same way as for the scaling function R(y), we define C(-) also on the full shift space E, setting 
C(x) = C{y) for x = (y, x). We will have: 

Theorem 5.4. For every x £ C , the sequence C„ jX = C Xo ... Xn is asymptotic to C(a n (x)), with an exponential 
rate of convergence, for any x m E with the same future coordinates (xo, x\,. ..). 

We will write Cc for the (compact) subset of £* +7 (with respect to all three metrics) which is the range 
of the function x 1— > C(x). This is the collection of (marked) limit sets. Since the function is continuous 
and the domain E is compact, we then have: 

Proposition 5.5. The collection of limit sets is compact. 

Definition. Given a C 1+1 hyperbolic Cantor set C, the set-valued scenery process of C is the process 
C(a n (x)) = C CT "^, with ieS, distributed according to the measure v. 

Note that stationarity and ergodicity of this process follow immediately from invariance and ergodicity of 
the measure v. 

The space of paths is a compact subset of the Polish space n+^£j* (or Ilt^f* for the marked sets), with 
the product topology determined by the topologies of any of the three metrics. See the proof of Corollary 
5.6. 

We mention why we use probability terminology - the scenery process - for the map C(x) ^ C(ax). Note 
that this dynamics is not in fact given by a map on Cc itself. Indeed at every stage you have two choices 
- the right- or left-hand subsets from the next level of the ratio Cantor set C (x) , with the choice of left or 
right depending on whether xo is or 1. Or, from a different viewpoint, one has the dynamics on Cc of a 
semigroup action; see the note at the end of the paper. 

The next result is like Corollary 4.2. 

Corollary 5.6. For [i—a.e. x e C, the sequence of (rescaled) Cantor sets C n ^ x which nest to x is a generic 
point for the stationary ergodic set-valued process C a determined by (£, <r, v). 

In §2, we defined [i n ^ x to be the sequence of Hausdorff measures H d \c n x . Convergence of C„ iX to C(y) in 
the measure metric can be rephrased as follows: 

Corollary 5.7. For every y = (. . . y_„ . . . y_i) e the limit 

M{y) = lim A^-„...y-i 

n >oo 

exists, and is Holder continuous with exponent 7. 

The support of the measure M(y) is the set C(y), and M(y) is a measure- valued version of the scaling 
function. 

As we did for the set- valued process, we define M(x) — M(y). We define the measure- valued scenery 
process M(n,x) = M(a n (x)), again with x £ E, distributed like v. We have: 
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Corollary 5.8. For fi—a.e. x G C, the sequence n Xo ...x n is a generic point for the stationary ergodic measure- 
valued process M(a n (x)) determined by (Y,,a,i>). 

Now we proceed to the proofs. For each interval I Wo ... Wn with Wi = or 1, we write A Wo „, Wn for the affine 
map which expands the interval to the unit interval I. We then set for y 6 C, A v n = A y _ n ,,, y _ 1 . This expands 
the interval I y _ n ... y _ 1 to / afhnely. Next, define maps <f \ n : I — > / for k > n by: 



^Pk k = identity and 

for k > n. 



We will also write <p v k for ip\ . 

For < n < k, we define $^ : / — > / by: 



It follows that: 



(1) for each n, ®n,n = identity, 

(2) for all m > k > n, 

*m,n = ® V m,k ° ®k,n and 

(3) ^ i0 = ^o^. 

We will also write <& y n for $^ . 

The sequence $^ is, to use Sullivan's words, "a sequence of ... compositions (of contractions) ... renor- 
malized by post composition with linear maps to obtain mappings between unit intervals..." (Appendix of 
[Su 1]). As Sullivan states, and as we will prove in the next section, such a sequence is precompact in C(k, a) 
if the original hyperbolic Cantor set is C{k, a). 

This gives convergence along some subsequence $^ fc . However unfortunately, that is not enough for our 
goal of proving an ergodic theorem for Cantor sets and measures (i.e. Corollaries 5.6, 5.8) - for that purpose 
we want instead to prove that the sequence itself converges. We do this in the next theorem, using bounded 
distortion, and then in the next section we return to Sullivan's idea to prove smoothness of the resulting 
limiting conjugacy. 

Theorem 5.9. Let (C,S) be a C 1+1 -hyperbolic Cantor set. For each y in the dual Cantor set C, 

^ v = lim §1 

n — >oo 

exists. This is an order-preserving diffeomorphism from I to I . Convergence is of order 0(/3™ 7 ) in the C 1 
norm, uniformly in y, and the function y i— > 3> y is Holder continuous of order -f, in the (3-metric. 

Proof. We will show, using bounded distortion, that for n large and for k > n arbitrary, $^ n is close to the 
identity. Then since Q v k = $^ o $^ , this will imply convergence. 

Now since the maps A are affine they have constant derivative. So for each a E I, for m = k — n, we have: 

F>A V 

(D(*l n ))(a) ee D(Al o (S-T 1 o (Al)-')(a) = D ^ m{z) for z ee {Al)-\a). 

DS m (z ) 
= DS m (z) S ° mC Z ° G 

by the Mean Value Theorem. Therefore by Corollary 2.2 
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for all k > n, y € C and all a G /. This implies the sequence n = 0,1,2... is Cauchy, hence 

converges. Since $^(0) = for all n, it follows from the Fundamental Theorem of Calculus that the limit 
<& v = lim^oo exists, and that D$ y = limD^. By Corollary 2.2, D$ y is bounded away from and oo 
by e ±Kfr ' ; in particular, <& y is an order-preserving diffeomorphism from / to /, as claimed. 
We define for each n 

$ v = lim ■ 

k — >oo ' 

the limit exists by the above arguments, and this map is C 1 -close to the identity map X : I — > I. We have 
for each n that 

and that: 

(1) 3k a > such that 

\\^ n -I\\ci<k ^. 

The constant ko here only depends on K from Corollary 2.2, which in turn depends on (3, the upper bound 
for |-D<£i|. Here is the calculation: we have 

||^,„-J|| C l<P^ ;n -l||oo 

and we know that 

for all n. Now since for a; in the interval [e~ K ,e K ] one has \e x — 1 < fco^ + 1, where we take fc = 
(exp(exp(_ft')) — l)/(cxp(K)), statement (1) holds true. 

Now recall from the proof of Proposition 5.1 that for /, g : I —> I with /(0) = and g(0) = then 
||<?°/||ci < 2||g||ei||/||ci- From this, it follows that statement (1) is equivalent to: 

(2) 3fci > such that 

||#»-$ , '||ci<*i/? n7 . 

(Here we can take k\ — (e K )ko). 

Both statements express, in different ways, that <I> y is close to <f>^ , with exponentially fast convergence; 
(2) is what we stated in the Theorem. Finally it is now also easy to check Holder continuity: 3k2 > such 
that for all y,w e C, 

\\&-* w \\ci<k2(df,(y,w)y. 

□ 

Proof of Theorem 5.3. Writing C y for $£(C) = C y _ n ... y _ 1 , and C v = $ y {C), statement (2) above says 
exactly: 

d c (C y ,C y ) < 

Hence we have convergence in the C 1 metric. By Proposition 5.2 therefore, dn has the same bound. For the 
measure metric, we have 

d M (cic y )<^(d C y(c\cy)) 
= *(||#So, n -x|| cl ) 

< *(fc /?" 7 ) 

by (1), and this is < k^(3 ni where k^ = ko(5 + 4fc ). Next we show Holder continuity. Now dc{C v ,C W ) — 
— $™||ci so (3) proves Holder continuity for the C 1 and Hausdorff metrics. Then, applying Proposition 
5.1, d M (C y ,C w ) < ${d C y(C y ,C w )) < *(dc(Cf , C ,u, )||^- 1 ||ci) < *(e K fc 2 /?" 7 ) < fc 4 /3" 7 where fc 4 = 5a + 4a 2 
and a = e K ki. Finally, it is clear from the constructions that C(y) has ratio geometry given by R(y), hence 
C{y) is indeed equal to C v . □ 

Proof of Theorem 5.4- From the proof of Theorem 5.3, since the exponential bound is uniform over all sets 
of level n, we have that the dc-distance from C(a n x) to C Xo ... Xn is bounded by The bounds for dn 

and du then follow as above. □ 
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Proof of Corollary 5.6. We give the proof for the process which takes values in the collection of marked sets. 
Here the space will be 11^,5^ (C), with the shift transformation; this is a Polish space (since E\ is; we use E\ 
rather than £* +7 so as to have a complete space). Therefore we know from our definitions what it will mean 
for the one-sided sequence C n ^ x to be a generic point. Now the map x i— » (. . . C(a~ 1 x), C(x), C(ax) . . . ) from 
S has as its image a compact invariant subset of IP° 00 £c' C IFf^^^C); this image is the space of paths 
of the scenery process, and is the support of the image of the measure v. The ergodicity of (£, i>, a) passes 
over to the scenery process, hence a.e. path (. . . C(a~ 1 x), C(x), C(crx) . . . ) is generic for the shift on path 
space. Finally, since by Theorem 5.4 we know the sequence C n>x is forward asymptotic to C(a n x), we will 
compute the same time average for the continuous functions. Thus C n>x is generic, for v-a,.e. x and hence 
for ^i-a.e. x. □ 

§6. Smoothness of conjugacies. Now we will see how to prove the conjugacies of §5 in fact have higher 
smoothness properties. 

The basic idea will be to imitate what one knows about analytic maps, for C k+1 or C(k,j) maps. Thus 
Lemma 6.2 is a version of Leibnitz' formula, and Lemma 6.3 is one step in showing C k+1 maps are morphisms 
in a category. This means they can be used to define equivalence relations on sets, and to give the analogue 
of diffcrentiable structures. One also imitates the Arzela-Ascoli Theorem, in Lemma 6.4; as Sullivan says in 
the Appendix of [Su 1], and quoted in the previous section, the sequence <I>^ will be precompact in C(k, a). 
(As usual, we do our proofs instead in C k+1 ). 

Here is the main theorem we are aiming for. 

Theorem 6.1. Let C be a C k+1 hyperbolic Cantor set, and let & v : I —> I be defined as in §5. We claim: 
(i) if k = 1,2, ... , then <$> y is C k+1 (with same Holder exponent, but a different Holder constant), 
(ii) if k = oo or u, then $ v is C°° or C u respectively. 

First we need a few lemmas. 

Lemma 6.2. For A C R, if f : A — > M and g : A — > R are bounded, 7 -Holder continuous with Holder 
constants c, d, then: 

(i) f + g is "f -Holder with constant c + d, and 
(ii) f ■ g is "{-Holder with constant cWgW^ + d||/||oo- 

Proof, (i) is immediate. The argument for (ii) comes by imitating the proof of Leibnitz' rule in the Calculus: 

\f(x)g(x) - f(y)g(y)\ = \[f(x) - f(y)]g(x) + f(y)[g(x) - g(y)}\ 
< c\x - j/HMloo + d\x - ?/| 7 ||/||oo- 

□ 

Lemma 6.3. Fix k > 1. For A, B C R, let f : A -» B and g : B -» M be such that D k (f), D k (g) are 
bounded and 7 -Holder. Then D k (g o /) is "{-Holder. 

Proof. This now follows by induction, from the Chain Rule plus Lemma 6.2. □ 

The next lemma is basically the same as the "bounded variation" lemma, Lemma 1.15 from [Bo 1], except 
it is written in the reverse direction, for the contractions <pi instead of the inverse map S. For the special case 
/, = ip w . and hi — log \Dip Wj \ where i + j = n, one gets exactly the Bounded Distortion Property (Theorem 
2.1). The formulation given here is from the Appendix of [Su 1]; the key idea for proving our Theorem 6.1, 
which is also in that Appendix (the sentence immediately preceding the Corollary there), will be how to use 
this lemma to control higher order derivatives of the composition. One can summarize the idea as follows: 
do not look at log \D k (p Xi |, but instead at D fe_1 log \D(p Xi \. Then we are applying the linear operator D k to 
a sum, which leads to the proof. 

As usual, for notational simplicity we assume strict hyperbolicity. 

Lemma 6.4. Consider a composition of contractions f n o ■ ■ ■ o / 1; with fi : Ji — > for intervals Ji C R 
, and such that \Dfi\ < [3 < 1. For a point x G J, write X\ — x, iTj+i = fi(xi). Let hi : Ji — > R be Holder 
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continuous with the same exponent 7 G (0, 1] and same constant c. Then for h(x) = hi(xi) + • • • + h n {x n ), 
h is also ^-Holder continuous, with constant cq — c/3 7 /(l — /3 7 ) (independent ofn). 

Proof. Immediate from the geometric series, since for x,y <E J\ we have \xi — y,| < /3\ □ 

The next little lemma is more subtle than one might at first think. We wish to thank Z. Nitecki and 
M. Urbanski for discussions which resulted in a first proof, and Y. Kifer for then finding the much simpler 
argument given here. 

Lemma 6.5. Let /„ : / — > / be continuous functions with continuous k th derivative and assume that there 
exist functions /, g such that: 
( i) fn -> / and 

(ii) D k f n — ► g, uniformly as n — > 00. Then D k f = g. 
Proof. We define, for each < j < k, functions gj, and sequences of functions f n j and p n j by: 

g k = g and &-i(t) = / gj; 

Jo 

fn.k = D k f n and f n ,j-i(t) = / f n ,j\ 

Jo 

Pn,j — ^ ' fn fn.j- 

Thus 

Pn,k = 0, 

Pn , k ^{t) 3 = {D k -\f n )(t) - ((D k -\f n )(t) - (D k -\f n )(0)) = (D k -\f n )(0), 
Pn,fc- 2 (t) = D k - 2 f n (0) + t(D k -\f n )(0) 

and similarly, (for each n) p n j is for all j a polynomial of degree k — j — 1, such that Dp n j = p n ,j+i- Now 
for each j, lim^oc f n j = gj. In particular, 

.go = lim f nfi = lim (/„ - p n ) = f - limp„. . 

n — >oo n — >oc 

Hence lim^oc p n $ converges (uniformly), so to some polynomial powith degree at most k — 1, and we have 
go = f — Po- Therefore, 

g = D k g = D k f - D k p = D k f 

as claimed. □ 

Proof of Theorem 6.1. Since D k ip and D k ipi are 7-Holder, by Lemma 6.3 so is 
D k ~ 1 logDipi, with some Holder constant Co- Now we apply Lemma 6.4 to 

fn°---°fi=fy n °---o<P yi = <P V „ 

and 

hj=D^-^ logDLp Vj . 
For x £ I, writing x\ = x, x 2 = h\{x) etcetera as in Lemma 6.4, since 

n 

we conclude that Z)( fc_1 ) log Dip n is 7- Holder, with some different constant C\ which is however independent 
of n. 
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Now to prove the Theorem, first consider the case k > 2. Here we have D {k -^ \ogD<$>y n = D^ k ~^ log D (pi, 
since the constant derivative of Al disappears upon higher differentiation. For k = 1 these are not equal; 
they differ by the constant log DAI (which increases with n) . However these cancel upon subtraction, so in 
either case we have, for any a,b G I, 

log D<S>Z (a) - log D$l (b) | = | log Dpi (a) - log D(p y n (b) \ . 

Therefore for all k > 1, D k ~ x log D&l is a sequence of bounded functions which is 7- Holder with the same 
constant, c\. Also, this sequence is uniformly bounded. For k = 1 this follows from bounded distortion, 
as in the proof of Theorem 5.4, and in fact a bound is e K ^' . For k > 1, we argue as follows: if it were 
unbounded, then by Holder continuity with the same constant, some subsequence goes uniformly to either 
+00 or —00. By integration (k — 1) times, by induction this contradicts the boundedness for k = 1. This 
implies equicontinuity. Now by boundedness and cquicontinuity, there is some convergent subsequence, using 
the standard diagonalization argument as in the proof of the Arzela-Ascoli Theorem. At the same time, from 
§5 we know that lim D$l = D<& y exists which implies log D$>1 converges to log D<& y . Calling the subsequence 

log £><!>». =/,-, 

we are in the situation of Lemma 6.5: fj — > /, D k ~ 1 fj — > g hence D k ~ 1 f = g. Thus D k ~ x logD<§> y is a 
uniform limit of 7- Holder functions with the same constant c a , hence the limit is 7- Holder with constant c a . 

From Lemma 6.3, D k $ v is also 7-Holdcr and we are done for k = 1, 2, 

Finally note that for k = 00 we are done by part (i), and for k = w we can apply Arzela-Ascoli to see that 
{&l} is a normal family, hence the limit <f> v is also analytic. □ 

Remark:. We emphasize again the subtle point in the logic of this argument: C 1+7 convergence of $1 to 
is not known. What we do know is convergence in the C 1 norm (from Theorem 5.9) and convergence 
along a subsequence in the C 1+7 norm, as just shown. This is enough to prove the claim of the Theorem. 

§7. Smoothness of limit sets, and rigidity. 

Given two C 1+7 hyperbolic Cantor sets (C, S) and (C, S), recall that the (full) conjugacy $ is an order- 
preserving map defined on all of /. This map is uniquely determined on C by the conjugacy equation, since, 
as one sees, the symbolic dynamics is preserved. Note that for any two topological Cantor sets, once they 
have been coded by the two-shift S + in an order-preserving way, this conjugacy on the Cantor sets extends 
to a homeomorphism on /. The issue therefore is what types of conjugacies preserve what type of structure. 
As is well known and not hard to show, for instance, a biLipschitz $ will preserve the Hausdorff dimension. 
We noted in [BF 1] that C 1 maps preserve the order-two density. Furthermore for C 1 conjugacy from [Su 1] 
one has: 

Lemma 7.1. If two C 1+7 hyperbolic Cantor sets are C 1 conjugate, then they have the same scaling function. 

Proof. By uniform continuity of the derivatives, since we already know the scaling functions exist from 
Theorem 3.1, this is immediate. □ 

Hence under the same assumption, by Corollary 5.3 we have: 

Corollary 7.2. They have the same collection of limit sets. □ 

To prove our rigidity theorem, we will need the following. 

Lemma 7.3. Let (C, S) be a hyperbolic C k+1 Cantor set. Let S : Io U I\ — > / be a C k+1 map such that 
S = S on C. Then S and S are C k+1 conjugate. 

Proof. The conjugacy is the identity map on C; what we want to do is define it on the gaps. We begin 
by defining $ to be the identity also on the gap G between Iq and I\. The conjugacy is then uniquely 
defined from the conjugacy equation, by the dynamics. That is, writing G Xo ... Xn = (p Xa ...x n {G), we have for 
a £ G Xo ,,, Xn , 

$(a) = <?x ...x n (<Pxl.. Xn {a)) = $ X0 ... Xn (S n (a)). 
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One immediately checks that with this definition, $ is a conjugacy. 

This map is C k+1 on the interiors of all the gaps. At points in C, to check C k+1 one must be careful 
because these points are also limits of interior points in the gaps. 

Here is one way of proving $ is everywhere C fc+7 . 

Define a sequence of maps : / — > / by: $ = the identity, $1 = $ on G and (p Xo o S everywhere else 
(i.e. on I U/i), and inductively, set <!>„ to be equal to everywhere except on \JI Xo —x n , where it is 

defined to be <p Xo ... Xn ° S n . 

These maps converge uniformly to <&. So if we can show that for each n, D k <& n is 7-Holder with a constant 
independent of n, this will carry over to the limit and we will be done. (Here we will use the fact that the 
maps (p Xo ...x n S n are 7- Holder with a fixed constant). 

The advantage of this method is that we must only check smoothness at each stage, and so each time at 
only finitely many points. 

Now consider the map / = (p Xo ... Xn o S n on C D I Xa ... Xn . It is the identity there, and since C is dense in 
itself, Df = 1 on that set. Since it is twice diffcrcntiable, D 2 f = there and similarly for D k f. Therefore 
when we define Q n by gluing together (Pxo,...,x n -\ 

o S n and (p Xo ... Xn o S n at an endpoint p, the two functions 
agree at p for all derivatives < k. Also D k is 7-Holder, for each piece. Hence for all n, $ n is C k+1 with a 
fixed Holder constant, as we wanted to show; so we are done. □ 

This produces one conjugacy. In §8 we will return to this proof, in order to study how many such maps 
$ there are. 

We are now ready to prove: 

Theorem 7.4 (highest smoothness). Given a C 1+7 hyperbolic Cantor set C, its limit sets have the highest 
degree of smoothness of any hyperbolic C 1+7 Cantor set in the C 1+7 - conjugacy class of C. 

Proof. Let (C, S) be a C fc+7 , C°° or C w hyperbolic Cantor set which is C 1 conjugate to C. By Corollary 
7.2, C and C have the same limit sets. And by Theorem 6.1, the map $ a : I — > I defines a dynamics 
S v : Iq U I\ — > I by conjugation with the map 5, which has that same degree of smoothness. □ 

Theorem 7.5 (rigidity). If (C, S) and (C, S) are two C k+1 , C°° or C u hyperbolic Cantor sets which either 
(a) are C 1 conjugate by a map $, or (b) have the same scaling function R, then they are in fact conjugate 
by a map $:/—>/ which for (a) agrees with $ on C or for (b) agrees with the coding; this map is C k+1 , 
C°° or respectively. 

Proof. By either hypothesis they have the same limit sets. Choose one, C v . Again by Theorem 6.1, the 
maps $ y have the same smoothness as S, S. Now let S y , S v denote the maps defined on Iq U If by 
these conjugacies. We are exactly in the situation of Lemma 7.3, and have a conjugacy $ of S v and S v . 
Composing the three maps 

o $ o $v 

finishes the proof. □ 

§8 Banach space structure. 

Fix a hyperbolic C 1+7 Cantor set (C, 5). For r = k + 7 where 7S (0,1], k > 1 or for r — 00, lo we write 
£ r = £ r (C) for the collection of Cantor sets (with maps) which are C r - conjugate to (C, S). (From Lemma 
1.1.2, these are also hyperbolic C 1+7 Cantor sets). The spaces DifF, £ r , £1 and are defined as they were 
in §4 for the case r = 1 + 7. 

In this section we will see how £ r can be viewed as a Banach manifold, in fact a Banach Lie group. We 
will also define a natural topology on £ r , and show that £ r factors nicely over £ r as a topological space. 

We define first the C 7 norm on the Holder functions C 7 (/,R) to be 

llfll llfll cur, Z IhPl 
\\f\\Cj = H/lloo + SUp — - — — . 

x.yel \ x y\ 

For r = k + 7 where 7 £ (0, 1], k > 1, the C r norm will be 

^\\D l f\\ oa + \\D k f\\ Cl . 
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For C°° we define 

||/||coo=SU P {||£) , /||oo}, 
I 

and for C w we will use the sup norm (since it is equivalent to all the other C r norms there). 

By definition a Banach manifold is a manifold which is locally modelled on a Banach space, and a Lie 
group is a group, which is also a C°° manifold modelled on a complete, locally convex vector space (see e.g. 
[Mi]). Recall that DifF denotes the C r order-preserving diffeomorphisms of /. Now choice of a set in £ r 
identifies the collection £ r with DifF, as we have seen in §4. DifF is an open subset of Cq 1 (/,R), which is 
how we will write the set of all C r functions from J to R such that /(0) = and f{l) = 1. This in turn is a 
closed affine subspace of C r (I, K). To see this note that, defining 

S$ ;1 (/,M) = {/ e C r (I,R) : /(0) = = /(l)}, 

two functions in Cq i differ exactly by an element of i- Now Bq 1 is a Banach space, with the C r norm. 
Hence £ r is a Banach manifold: it is identified with DifF, which in turn corresponds to an open subset of 
Bq 1 . Now DifF is a group, hence it (and therefore £ r ) is a Banach Lie group. Two choices have been made: 
the choice of a Cantor set in £ r , and of a special point (the identity) in DifF. These choices determined 
the maps to Bq x and hence the metric (inherited from the C r norm) . Both choices moreover amount to the 
same thing: changing C to D in £ r (as in Proposition 5.1, for r = 1) corresponds to a right translation in 
the group DifF. 

Now in a Lie group one ideally would like to work with a (left- or right-) invariant metric. If the group is 
compact (or, more generally, amenable), one can make a given metric invariant (while keeping equivalence) 
by averaging over translations. In our case, however, one cannot get an equivalent invariant metric- DifF 
is not only non-compact but non-amenable! The (non-uniform) bounded equivalence proved in Proposition 
5.1 is nevertheless enough for what we needed, for the proof of Theorem 5.3. 

In summary we have: 

Proposition 8.1. £ r is a Banach manifold. It is naturally identified up to right composition with the 
Banach Lie group Diff r , and with an open subset of a closed affine subspace of C r (I,M). □ 

A similar estimate to that shown in Proposition 5.1 for r = 1 holds for r > 1. Therefore one has, for 
r = k + 7, co, u) : 

Proposition 8.2. 

(a) The C r metric on Diff r is right-invariant up to (non-uniform) bounded equivalence. 

(b) The C r metric on £ r is base-point independent up to (non-uniform) bounded equivalence. 
□ 

We note that the space of marked Cantor sets, is also a Banach manifold, by the same reasoning as 
for DifF: it is an open subset of a closed affine subspace of C r (C, R). 

Next we will describe more fully the relationship between the spaces £ r and £ r . For r = k + 7, we write 
DifiJ(J) for the collection of C r diffeomorphisms of the unit interval / whose first k derivatives are 1, 0, . . . , 
at the endpoints. This is also a Banach manifold. 

Proposition 8.3. Given the choice of a Cantor set C , £ r factors naturally, set theoretically and topologi- 
cal^, as 

£ r = £ r xDiff (I), 

with the topology on £ r defined below. 

Proof. First, let us consider how many C r maps from I to I there arc, conjugating (C, S) with (C, S) (the 
Cantor sets are the same, but the maps may be different off of the Cantor sets). In the proof of Lemma 
7.3, note that instead of starting with $ equal to the identity on the gap G we could have taken any C k+1 
diffcomorphism from G to itself, whose derivatives agree with the identity at the endpoints, up to order 
k. Conversely any conjugacy is specified by its values on G, since elsewhere it is then determined by the 
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dynamics. Therefore we see that the set of C r conjugacies from (C, S) to (C, S) correspond naturally to 
Diffg(7). (This is also true when S = S\) 

Next we consider how many different C r conjugacies are possible in the rigidity theorem, from (C, S) 
to (C, S). Given the existence of one such map and hence a restricted conjugacy, we can define (all the) 
other extensions by a method like that used in the proof just given. That is, we first define the conjugacy 
arbitrarily on the first-level gaps (but with the correct derivatives of order < fc at the endpoints). Then we 
extend by the dynamics. Or, we can quote that statement directly, making use of use a ratio Cantor set as 
intermediary as in the proof of Theorem 7.5, and now replacing $ by one of the more general maps described 
above. 

This shows we have a product of sets. £ r has not yet been given a topology. But from the product 
decomposition, we can define a family of metrics as follows. Choosing one element of Diffjj defines an 
embedding into £ r , and we just use the C r metric there. (One would like to get a more natural definition by 
taking the infimum over all such choices; however, it is then not clear that the triangle inequality will hold). 
At any rate the metrics are equivalent, so this defines a natural topology on £ r . 

We will show that the C r metric on £ r is equivalent to the product of the metrics on £ r and DifiJ . 

It is easy to see that the map from £ r to each factor is continuous. (To DiffJJ it is also affine). For the 
converse, given the base point (C, S), let first /, g G DifF* be such that (C/, Sf) = (C g , S g ). Write fo, go for 
the corresponding elements of Diffjj, i.e. the restrictions of / and g to the middle gap of C (rescaled in the 
range). We claim that if fo and g are close in Diffg then / and g are close in DifF. The formula for / on 
an n th level gap of C is 

/(a) = <fii ...x n °fo° fxl..xja). 

Here ip, tpt denote inverse branches for S and Sf respectively. By assumption tpf = ip 9 . Now by bounded 
distortion (Lemma 6.4) for fc = 1, and for general fc by the proof of Theorem 6.1, D k (pl o Xn is uniformly 
7— Holder with constant independent of n. This proves ||/ — g\\ci is small, which is what we wanted to show. 

Next we drop the assumption that / and g give the same maps. We have chosen an element of DifPg 
to define the metric on £ r . Let /, g denote the maps in DifF such that /o = go is that element, with 
(Cf,Sf) — (Cj, Sf) and similarly for g. Now by definition, the distance between the pairs in £ r is ||/ — fl||ci- 
So we just apply the triangle inequality using the previous case, to conclude that dc{Cf, C g ) = \\f — g\\ci is 
also small. □ 

Remark. In a conversation about the proof of Lemma 7.3, Yair Minsky pointed out to us an interesting 
parallel between that argument and Sullivan's "flexibility and rigidity" theorem for Klcinian groups. Sullivan 
showed, for a finitely generated Kleinian group T, that the limit set A of the group itself is "rigid" , i.e. a 
quasiconformal conjugacy (to another Kleinian group) which lives (Lcbesgue almost-surely) on A must be 
Mobius. This is a consequence of Sullivan's lemma that A carries no measurable T- invariant line fields. (Note 
that by contrast, for hyperbolic Cantor sets, quasisymmetric conjugacy does not imply smooth conjugacy; 
as we have seen, one also needs to know the scaling function). 

Sullivan used this to show that a quasiconformal conjugacy is determined by a Beltrami differential on 
ft/T where ft is the domain of discontinuity. Thus, in a sense, one has rigidity on the limit set and flexibility 
off of it. The group actions correspond to the two (not restricted) expanding maps, and the surface O/T, or 
equivalently a fundamental domain for the action of T on 17, is analogous to the gap G of the Cantor set. 
As is the case there, the conjugacy is then specified elsewhere by the dynamics. Sullivan's theorem can then 
be stated as follows: Tcich(r)= Teich(f2/r), where this refers to the Teichmiiller space of a group and of a 
surface respectively; this formulation led to the statement in the Proposition above. 

Concluding remarks: limit sets as the attractor of a semigroup action. £ 1+7 denotes the C 1+7 
equivalence class of a given C 1+7 hyperbolic Cantor set. The nested subclasses £ r , for maps of smoothness 
r = fc + 7, 00, u, also are conjugate with that higher degree of smoothness. Thus smoothness classes are also 
conjugacy classes. Choosing one set in £ r as a base point, £ r is naturally identified with a topological factor 
of the C r orientation-preserving diffeomorphisms of the interval, DifF which is a Banach manifold. Moreover 
we can choose one Cantor set as a common base point for all the £ r , since by Theorem 7.4 smoothest 
Cantor sets exist. Then the nested collections £ 1+1 D • • • D £ r . . . are naturally identified with factors of 
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Diff 1+7 D ■ ■ ■ D DifF .... (Each is a Banach manifold with its own topology, and is a dense subset of the 
larger collections, with respect to their topologies). The spaces of marked Cantor sets are also Banach 
manifolds. The free semigroup on two generators FS2 acts on each submanifold by replacing it with 
its left or right Cantor subset. From Theorem 7.4, the limit sets are in the intersection of the From 
Theorem 5.3, because the bounds are uniform over all Cantor subsets of level n, the collection of limit sets is 
an attractor for this action. This convergence is exponentially fast in the C 1 norm. (Warning: we have only 
shown convergence in this norm; see the Remark at the end of §6). The semigroup action on the attractor 
itself can be described symbolically very simply as follows. Recall the map y 1— ► C v for y in the dual Cantor 
set S~ and C v the corresponding ratio Cantor set. Now just concatenate y on the right with a finite string 
of symbols. 
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